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Abstract- Biotechnological methods can be used for cryptography. This paper presents an implementation of 
steganography using DNA molecules. We first encode a plaintext message into a DNA sequence using a randomly 
generated single-substitution key using the general DNA codon table. This sequence is then mixed and inserted into 
a background DNA. 
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I. INTRODUCTION 

Before the modem era, cryptography was concerned with message confidentiality (i.e., encryption) - conversion of 
messages from a comprehensible form into an incomprehensible one and back again at the other end, rendering it 
unreadable by interceptors without secret knowledge i.e., the key needed for decrypting. DNA cryptography is one 
of the rapid emerging technology which works on concepts of DNA computing. A new technique for securing data 
was introduced using the biological structure of DNA called DNA Computing. The technique later on extended by 
various researchers for encrypting and reducing the storage size of data that made the data transmission over the 
network faster and secured. [1] S. Jeevidha et.al analyzed an existing approach to the DNA computing method and 
DNA based cryptographic approach [2], A. Atito proposed a novel algorithm to communicate data securely. The 
proposed technique is a composition of both encryption and data hiding using some properties of Deoxyribonucleic 
Acid (DNA) sequences [3]. K. Menaka proposed an algorithm using DNA sequences for data hiding is proposed 
and discussed for secure data transmission and reception [4], D. Umalkar et.al., purposed the formula to write the 
message with the premise of complementary rules deoxyribonucleic acid sequence [5]. Kritika Gupta et.al., done a 
review on DNA based cryptographic techniques [6], Eihab Bashier et.al proposed a technique of generating artificial 
DNA sequences using chaotic maps running on the domain of integer numbers [7]. M. Yamuna proposed a method 
for safe transfer of messages using periodic table [8]. Fatma E. Ibrahim et.al proposed a new data hiding algorithm 
based on DNA sequence [9]. Nirmalya Kar et.al proposed a new algorithm which has two rounds of encryption . 
Moreover paper contains a new method of key generation and sharing for better security aspects. The encryption 
scheme is designed by using the mechanism of DNA sequencing [10]. M. Yamuna et.al., developed a new binary 
periodic table and hence use this for encrypting details about any drug [11], 

II. PRELIMINARY NOTE 

In this section we provide some basic results required for the proposed method. 

A. DNA Codon Table 

The genetic code is the set of rules by which information encoded within genetic material (DNA or mRNA 
sequences) is translated into proteins (amino acid sequences) by living cells. Biological decoding is accomplished by 
the ribosome, which links amino acids in an order specified by mRNA, using transfer RNA (tRNA) molecules to 
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carry amino acids and to read the mRNA three nucleotides at a time. The genetic code is highly similar among all 
organisms, and can be expressed in a simple table with 64 entries [12] 

We use the standard codon table in snapshot - 1 [13] for our encryption scheme. 



nonpolar polar basic acidic (stop codon) 



Standard genetic code 




Snapshot - 1 



B. Insertion Method 

In [ 14 ], H.J. Shiu et al introduced the insertion method, which is wide use now. Snapshot [ 14 ] of insertion 
method is given below 



Method 1: the Insertion Method 

To simplify the discussion, the most basic version is outlined and a simple example is given. The more complicated ver 
sion of the method will be presented after the basic one is explained. All of the methods use a reference sequence. S. Suppose 
the secret message M is 01001100. Let S be ACCCTTCCMTCC. The method works as follows: 

Step 1. Code S into a binary sequence by using the binary coding rule. Thus the sequence S will now become 
0001 1010111 1010100001 11001 . 

Step 2. Divide S into segments, whereby each segment contains k bits. Suppose k is 3. Then there are the following seg 
ments: 000. 110 101. 111.010.100.001. 110.01. 

Step 3. Insert bits fn>mM.oncatatimc.intothcbcginningof$cgmcntsofS.Thcresultisasfollows:0000. 1110.0101.011 1. 
10 1 ft 1 100. 0001,01 10, 01. Those segments without any secret message inserted should be ignored. Thus, there are 
the following segments: 0000. 1110, 0101.0111,1010. 1 1 Oft 000 1 , 0 1 1 ft Concatenating the above segments results 
in the following binary sequence: 00001 1 10010101 1 1 10101 100000101 10. 

Step 4. Use the inverse function of the binary coding rule to produce the following faked DNA sequence: 
S' AArCCCCTCCTAACCC. As the reader can sec. this sequence is quite different from S. 

Step 5. Send the above sequence S' to the receiver. 



Snapshot - 2 
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III. METHODS 



A. Proposed Encryption Scheme 

The sequence of the bases A, T, G, C in DNA determines our unique genetic code. This also provides instructions 
for producing molecules. There are 20 different types of amino acids which are the building blocks of proteins. 
Different proteins are made up of combination of amino acids. The 20 amino acids are in general represented using 
64 codons. DNA sequences represent the genetic privacy in medical field. Numerous data are available in public 
domain. We use the availability of these sequences for encrypting normal data. We use the regular codon table for 
constructing a new table which can be used for our encryption method. 

B. Construction of Modified DNA Codon Table 

We use the general codon table. There are 64 entries in the table. We randomly pick any 27 combinations ( one for 
each alphabet and one for blank space ) and assign them in the table. Table - 1 represents a randomly constructed 
table. 
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Table - 1 

As seen in the table each alphabet is now represented by a codon. In Table - 1 A is represented as TTT, B as TTC 
and so on. 

In regular insertion method, we choose a random sequence S. This string S is split into segments of size k. The 
message M to be encrypted is inserted between these k bits. Since insertion is traditionally used, once the size of the 
segment is detennined, decryption becomes relatively easy. To overcome this, we adopt a random insertion rather 
then inserting the message M into all the segments. 



87 



M. Yamuna et al. / International Journal of Engineering Issues 



C. Modified Insertion Method 

We use the insertion method slightly modified as random insertion, where after dividing the segment into segments 
of length k = 3 we can insert any bit of length 3 randomly at any place. 

For example if the DNA sequence to be inserted is GATAAGGAC. 

• We pick any random sequence of codons GGTAGCAGTAATACTATTATCGTT. 

• We split them into segments of length k = 3 to generate GGT AGC AGT AAT ACT ATT ATC GTT. 

• We then insert the codons in the original sequence randomly. The codons in the original sequence is GAT 
AAG GAC. 

• This randomly inserted generates GGT AGC GAT AGT AAT AAG ACT ATT ATC GA< GTT. 

D. Encryption Algorithm 

There are 64 entries in the codon table. Hence we can use this method for encrypting any message containing a 
maximum of 64 distinct characters. Table - 1 presents a sample table for normal message encryption using alphabets 
Let M be the message to be encrypted. 

Step 1 Construct a modified codon table as in Table - 1. 

Step 2 Assign the codons to the characters in the message M using Table - 1. 

Step 3 Choose a random DNA sequence S of any length not containing the 
codons used in Table - 1. 

Step 4 Insert the codons in M randomly into the sequence S to generate a 
sequence D. 

Step 5 Send the message D to the receiver. 

E. Decryption Algorithm 

For decrypting the message we reverse the procedure. 

Suppose the received message is S. Split the sequence S into codons. Filter the codons used in Table - 1 separately. 
Convert these codons into message using Table - 1. 



IV. DISCUSSION 



A. Illustration of the Encryption Algorithm 

Let M = ALGEBRA be the message to be encrypted 

ALGEBRA 

I I I I I I I 

TTT TAG TCA TCT TTC CTC TTT 

M = TTT TAG TCA TCT TTC CTC TTT 

Let ATTATCACTACCCGGAGTAGC be the random sequence. Split this sequence into segments of length k = 3, 
to generate ATT ATC ACT ACC CGG AGT AGC. Then we insert M into this sequence randomly. The resulting 
sequence is 

D: ATT TTT ATC ACT TAG ACC TCA CGG TCT AGT TTC AGC CTC ATT TTT 



Send the sequence 

D: ATTTTTATCACTTAGACCTCACGGTCTAGTTTAGCCTCATTTTT to the receiver. 
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B. Illustration of the Decryption Algorithm 

Suppose the received message S: ATT TC ACGG T G A ACC TG AATT AGC TTG ACTCAGCGG TAGCCCAC TT 
ATCTAA. 

Split the sequence into codons. Filter the codons used in Table - 1 separately. 

D = ATTTCA CGGTGAACC ATTAGCTTGACTCAGCGG CCCACTTTAAT 
CTAA. 

S = TCA TGA TGA TTG CAG TAG CCC TTA TAA. 

Convert these codons into message using Table - 1. 
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The message is decrypted as GOOD LUCK. 

Since insertion method is modified and adopted. Once the modified codon table is decided, the proposed method can 
be programmed easily in any of the common programming languages. 

V. CONCLUSION 



DNA cryptography is encrypting the message in tenns of DNA sequences. This can be done using several DNA 
technologies with the biochemical methods. This method can also be interpreted with other schemes in order to 
apply this technology in various fields. In this paper we use codons and DNA sequence to encrypt the message. 

• For normal text encryption we use only 27 characters. These can be picked from 64 codons in 64C 27 ways. 

• The remaining 64 - 27 = 37 codons can be used to generate the DNA sequence for insertion technique. 

• The message DNA is inserted into another DNA sequence. So it is difficult to determine which part is the 
encrypted DNA 

• Even if the encrypted DNA is filtered it is not possible to decrypt the message unless it is known how the 
table is constructed. 

The original insertion method is modified to random one. Also numerous DNA sequences are available in public 
domain. It is difficult to find the difference between the original and the encrypted one. So we conclude that the 
proposed method is safe for message transfer. 
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